1 Introduction

In searches for physics beyond the Standard Model (SM) at the Large Hadron Collider (LHC), quantum effects of virtual new heavy particles in high-energy observables play a crucial role. In the framework of the Standard Model effective field theory (SMEFT), such effects can be described by effective couplings that modify and extend the interactions of SM particles at energies within the reach of the LHC [1,2,3]. Extensions of the Standard Model at high energies, referred to as UV (‘ultraviolet’) theories, typically leave a pattern of effects in several observables. To pin down the structure of a UV theory, sensitive observables that probe complementary directions in the large parameter space of effective couplings have to be combined in a global analysis.

Top-quark physics at the LHC is very well suited to deciphering possible UV theories, both in sensitivity and in complementarity. Top-quark pair production provides a large number of precise observables, allowing the detection of even small deviations from SM predictions in the presence of new physics. Among these observables, the charge asymmetry, based on the different probability for top and antitop quarks to be emitted in a considered phase-space region, is particularly sensitive to new physics. In QCD, the charge asymmetry in \(t{{\bar{t}}}\) production first occurs at next-to-leading order (NLO) [4]; in \(t{{\bar{t}}} j\) production it is induced at tree level due to the presence of the additional jet [5,6,7,8]. Contributions from new particles can lead to significant modifications of the asymmetry compared to the SM prediction [9,10,11,12].

At the LHC, the charge asymmetry has been measured as a rapidity asymmetryFootnote 1 in top-quark pair production and is found to be in good agreement with the SM prediction [13,14,15]. Alternatively, the charge asymmetry can be measured as an energy asymmetry in top-quark pair production in association with a high transverse momentum (high \(p_{\text {T}} \)) jet [16]. The energy asymmetry is an important test of the charge asymmetry at leading order in QCD. By construction, the energy asymmetry is sensitive to the charge asymmetry in a different phase-space region than the rapidity asymmetry. In combination, the two asymmetries are therefore powerful probes of physics beyond the Standard Model in SMEFT. In this article, the energy asymmetry is measured with the \(13\,{\mathrm {TeV}}\) proton-proton collision data collected by the ATLAS experiment during Run 2 in 2015-2018 of the LHC.

The energy asymmetry in \(t\bar{t} {j} \) production is defined as

$$\begin{aligned} A_E(\theta _j) \equiv \frac{ \sigma ^\text {opt}(\theta _j | \Delta {E}> 0) - \sigma ^\text {opt}(\theta _j | \Delta {E}< 0)}{ \sigma ^\text {opt}(\theta _j | \Delta {E} > 0) + \sigma ^\text {opt}(\theta _j | \Delta {E} < 0) }\,, \end{aligned}$$
(1)

with the optimised cross section

$$\begin{aligned} \sigma ^\text {opt}(\theta _j)&= \sigma (\theta _j | y_{t{\bar{t}}j} > 0) + \sigma (\pi - \theta _j | y_{t\bar{t} {j}} < 0)\,,\nonumber \\&\qquad \theta _j \in [0,\pi ]\,, \end{aligned}$$
(2)

where \(\sigma (\theta _j)\) is the differential cross section as a function of \(\theta _j\), and \(y_{t\bar{t} {j}}\) is the rapidity of the \(t\bar{t} {j}\) system.

To measure the energy asymmetry the difference of the top and antitop energies \(\Delta {E} = E_t-E_{{{\bar{t}}}}\) is determined as a function of the jet angle \(\theta _j \). Both \(\Delta {E} \) and \(\theta _j \) are defined in the \(t\bar{t} {j}\) rest frame, which corresponds to the partonic centre-of-mass frame in tree-level processes. The angle \(\theta _j \) is defined as the angle between the jet direction and the positive z-axis, i.e., the direction of parton \(p_1\) in the process \(p_1 p_2 \rightarrow t\bar{t} {j} \). The orientation of the z-axis can be chosen in either direction along the beam line, due to the symmetry in proton-proton collisions. The energy asymmetry is mainly generated in the partonic process \(q g \rightarrow t{\bar{t}} q\), as illustrated in Fig. 1. The outgoing quark-jet is boosted in the direction of the incoming valence quark. This boost is reflected in the rapidity of the \(t\bar{t} {j}\) system in the laboratory frame, \(y_{t{{\bar{t}}} j}\). By combining ‘forward’ events having \(y_{t\bar{t} {j}} > 0\) with ‘backward’ events having \(y_{t\bar{t} {j}} < 0\) in the optimised cross section \(\sigma ^\text {opt}(\theta _j)\), the statistical sensitivity to the energy asymmetry is optimised [17, 18].

Fig. 1
figure 1

Examples of contributions to the energy asymmetry in terms of Feynman diagrams. The double lines indicate top quarks; the orange filled circles stand for possible insertions of four-quark operators in SMEFT. The dashed line symbolizes the interference \({\mathcal {M}}_1 {\mathcal {M}}_2^*\) of the two \(qg \rightarrow t\bar{t} {q}\) amplitudes \({\mathcal {M}}_1\) and \({\mathcal {M}}_2^*\) to its left and right sides. Contributions to the asymmetry in QCD are obtained by replacing the orange filled circles in the left diagram by gluons. Further details about the QCD prediction and SMEFT contributions can be found in Ref. [18] and Ref. [19], respectively

The energy asymmetry in \(t\bar{t} {j} \) production is measured in three bins of the jet angle \(\theta _j\) in a fiducial phase space defined at particle level. The analysis selects events with a high-\(p_{\text {T}} \) jet, one leptonic W decay from one of the top quarks (called in the following the leptonically decaying top quark) and one hadronic W decay from the other top quark (called in the following the hadronically decaying top quark). The decay products of the hadronically decaying top are required to be collimated in one large-radius jet, as is characteristic of the boosted regime. By focusing on this boosted regime, the additional jet is easily distinguished from the top-quark decay products. Moreover, the energy asymmetry increases with the transverse momentum of the associated jet [18].

The energy asymmetry is highly sensitive to the chirality of the top and antitop quarks [19], which allows the detection of even small potential deviations from the SM prediction. In SMEFT, such effects are described by four-quark operators that involve quarks with different chiralities. The detection of the additional jet in \(t{{\bar{t}}} j\) production modifies the relative contributions of effective operators to the observables. As a consequence, observables in \(t\bar{t} {j} \) production probe the parameter space of effective couplings in new directions beyond those sensitive to the rapidity asymmetry and other \(t\bar{t} \) observables [19]. The energy asymmetry is therefore a valuable new observable in global SMEFT fits [20,21,22,23,24], and thus the results of this measurement are interpreted in SMEFT to derive bounds on four-quark operator contributions.

This paper is organised as follows: a short description of the ATLAS detector is given in Sect. 2, followed by the object definitions at detector and particle level in Sect. 3; event selection and reconstruction are discussed in Sect. 4, while the simulated samples used to model signal and background events are described in Sect. 5. Sections 6 and 7 deal with the unfolding method and the considered systematic uncertainties, respectively; the measurement results are presented in Sect. 8 and interpreted in the context of SMEFT in Sect. 9. Section 10 provides a summary of the analysis.

2 ATLAS detector

The ATLAS detector [25] at the LHC covers nearly the entire solid angle around the collision point.Footnote 2 It consists of an inner tracking detector surrounded by a thin superconducting solenoid, electromagnetic and hadron calorimeters, and a muon spectrometer incorporating three large superconducting air-core toroidal magnets.

The inner-detector system (ID) is immersed in a \(2\,\hbox {T}\) axial magnetic field and provides charged-particle tracking in the range \(|\eta | < 2.5\). The high-granularity silicon pixel detector covers the vertex region and typically provides four measurements per track, the first hit normally being in the insertable B-layer installed before Run 2 [26, 27]. It is followed by the silicon microstrip tracker, which usually provides eight measurements per track. These silicon detectors are complemented by the transition radiation tracker, which enables radially extended track reconstruction up to \(|\eta | = 2.0\). It also provides electron identification information based on the fraction of hits (typically 30 in total) above a higher energy-deposit threshold corresponding to transition radiation.

The calorimeter system covers the pseudorapidity range \(|\eta | < 4.9\). Within the region \(|\eta |< 3.2\), electromagnetic calorimetry is provided by barrel and endcap high-granularity lead/liquid-argon (LAr) calorimeters, with an additional thin LAr presampler covering \(|\eta | < 1.8\) to correct for energy loss in material upstream of the calorimeters. Hadron calorimetry is provided by the steel/scintillator-tile calorimeter, segmented into three barrel structures within \(|\eta | < 1.7\), and two copper/LAr hadron endcap calorimeters. The solid angle coverage is completed with forward copper/LAr and tungsten/LAr calorimeter modules optimised for electromagnetic and hadronic energy measurements respectively.

The muon spectrometer (MS) comprises separate trigger and high-precision tracking chambers measuring the deflection of muons in a magnetic field generated by the superconducting air-core toroidal magnets. The field integral of the toroids ranges between 2.0 and \({6.0}\,\hbox {Tm}\) across most of the detector. A set of precision chambers covers the region \(|\eta | < 2.7\) with three layers of monitored drift tubes, complemented by cathode-strip chambers in the forward region, where the background is highest. The muon trigger system covers the range \(|\eta | < 2.4\) with resistive-plate chambers in the barrel, and thin-gap chambers in the endcap regions.

Interesting events are selected by the first-level trigger system implemented in custom hardware, followed by selections made by algorithms implemented in software in the high-level trigger [28]. The first-level trigger accepts events from the \(40\,\hbox {MHz}\) bunch crossings at a rate below \({100}\,\hbox {kHz}\), which the high-level trigger reduces in order to record events to disk at about \({1}\,\hbox {kHz}\).

An extensive software suite [29] is used in the reconstruction and analysis of real and simulated data, in detector operations, and in the trigger and data acquisition systems of the experiment.

3 Object definition

This section defines the final state objects at detector and particle levels. The particle-level objects are used to define the fiducial phase space for the measurement.

3.1 Detector-level objects

The primary vertex of an event is the vertex which has the highest sum of squared transverse momenta from associated tracks with \(p_{\text {T}} >0.5\,{\mathrm {GeV}}\) [30]. The primary vertex is required to have at least two tracks.

Electron candidates are reconstructed from clusters of energy deposits in the electromagnetic calorimeter that are matched to a track in the ID [31]. They are required to have a transverse energy greater than \(25\,{\mathrm {GeV}}\) and \(|\eta |<2.47\). Electron candidates are excluded if their calorimeter energy clusters lie within the transition region between the barrel and the endcap of the electromagnetic calorimeter (\(1.37<|\eta |<1.52\)). Electron tracks must satisfy \(|z_0 \sin {\theta }|<0.5\,{\mathrm {mm}}\) and \(|d_0|/\sigma (d_0)<5\), where \(z_0\) is the longitudinal impact parameter relative to the reconstructed primary vertex, \(d_0\) is the transverse impact parameter relative to the beam axis, and \(\sigma (d_0)\) is the uncertainty in \(d_0\). ‘Baseline’ electron candidates are required to satisfy the ‘Loose’ likelihood-based quality criterion of a multivariate algorithm [32] based on shower shape and track selection requirements. ‘Tight’ electron candidates are required to satisfy the ‘Tight’ quality criterion and to pass \(p_{\text {T}} \)- and \(\eta \)-dependent isolation requirements based on their tracks and energy clusters, defined for the ‘Tight’ isolation working point [32].

Muons are reconstructed by combining track information from the MS with tracks found in the ID. ‘Baseline’ muon candidates must satisfy \(p_{\text {T}} >25\,{\mathrm {GeV}}\), \(|\eta |<2.5\) and the ‘Medium’ identification criterion defined in Ref. [33]. This criterion uses requirements on the number of hits in the various ID and MS subsystems and the q/p compatibility between the MS and ID measurements. In addition, the track associated with the muon candidate must have \(|z_0 \sin {\theta }|<0.5\,{\mathrm {mm}}\) and \(|d_0|/\sigma (d_0)<3\). ‘Tight’ muon candidates are furthermore required to be isolated from additional activity in the event according to the ‘Tight’ isolation definition using particle-flow objects [33].

Small-radius (small-\(R\)) jets are reconstructed from particle-flow objects [34], using the anti-\(k_{t}\) jet clustering algorithm [35] as implemented in the FastJet package [36], with a radius parameter of \(R=0.4\). The impact of additional \(pp\) interactions (pile-up) is mitigated by using the jet-area method to correct the jet \(p_{\text {T}}\) [37]. Jets are calibrated to ensure a consistent jet energy scale and resolution between data and simulation as described in Ref. [38]. Only jet candidates with \(p_{\text {T}} >25\,{\mathrm {GeV}}\) and \(|\eta |<2.5\) are considered in this analysis. To mitigate the impact of pile-up, an additional selection criterion using a likelihood-based ‘jet-vertex-tagger’ (JVT) discriminant is applied to jets with \(p_{\text {T}} <60\,{\mathrm {GeV}}\) and \(|\eta |<2.4\) [39].

Small-\(R\) jets containing b-hadrons (b-jets) are identified (b-tagged) using the deep neural network (DNN) based algorithm, DL1r [40]. This neural network is based on distinctive features of b-hadrons in terms of the impact parameters of tracks and the displaced vertices reconstructed in the inner detector. The inputs of the DL1r network also include discriminating variables constructed by a recurrent neural network, which exploits the spatial and kinematic correlations between tracks originating from the same b-hadron. A selection requirement on the DNN discriminant output is used to identify b-jets. The chosen value provides an \(85\%\) average efficiency for identifying b-jets in simulated \(t\bar{t}\) events and c-jet and light-flavour jet rejection factors of 2.6 and 29, respectively.

Large-radius (large-\(R\)) jets are reconstructed with the anti-\(k_{t}\) algorithm from locally calibrated topological cell clusters [41] with radius parameter \(R=1.0\), and then trimmed using parameters \(R_\text {sub}=0.2\) and \(f_\text {cut}=0.05\) [42]. The jet mass resolution is improved by combining the jet mass measurement in the calorimeter with the measurement of the charged component of the jet within the ID [43]. The \(p_{\text {T}}\) and mass calibrations of large-\(R\) jets are based on \(13\,{\mathrm {TeV}}\) data and simulation [43]. Only large-\(R\) jets with \(|\eta |<2.0\) are selected. A large-\(R\) jet is considered to be b-tagged if there is at least one b-tagged small-\(R\) jet within \(\Delta R = 1.0\) of its axis.

Large-\(R\) jets containing top quarks are identified (top-tagged) with a DNN using jet substructure variables [44] as input. The top-tagger has an efficiency of 80% for large-\(R\) jets that are matched to a particle-level jet, which is in turn matched to a ‘truth’ top-quark in the MC generator’s event record, that has jet mass above \(140\,{\mathrm {GeV}}\) and at least one ghost-associated [37] b-hadron. The rejection factor for jets not matched to a top quark ranges from \(\approx 10\) to \(\approx 50\) for large-\(R\) jets with transverse momenta of \(2000\,{\mathrm {GeV}}\) and \(350\,{\mathrm {GeV}}\), respectively. To avoid correlations with additional analysis-specific selection criteria (see Sect. 4), b-tagging information is not included as input to the DNN tagger.

Ambiguities can arise from the independent reconstruction of electron, muon and jet candidates in the detector. A sequential procedure (overlap removal) is applied to avoid double-counting of physics objects, using the same algorithm as in Ref. [45].

The missing transverse momentum vector \(\, \vec {E}_{{\mathrm {T}}}^{{\mathrm {miss}}} \) (with magnitude \(E_{\text {T}}^{\text {miss}}\)) is calculated from the negative vector sum of the \(p_{\text {T}}\) of all reconstructed electrons, muons and small-\(R\) jets after overlap removal, as well as the \(p_{\text {T}}\) of soft activity comprising tracks not associated with those reconstructed objects [46].

3.2 Particle-level objects

Particle-level objects are defined according to ATLAS common practices for ‘truth’ particle observable definitions as outlined in Ref. [47]. These are specified as close as possible to the definition used for experimentally reconstructed objects using only physical particles that enter the detector to minimise model-dependent corrections. Particle-level objects in simulated \(t\bar{t}\) events are defined using quasi-stable particles with a mean lifetime \(\tau > 30\,\text {ps}\). They are selected after hadronisation but before the interaction of these particles with the detector components. Particles from pile-up effects are not included.

Electrons and muons must originate (either directly or from leptonic \(\tau \) decays) from the W bosons from top-quark decays, without requiring a direct match with the parent boson, excluding electrons and muons from hadron decays. The 4-momenta of the ‘truth’ charged leptons are modified (‘dressed’) by adding the 4-momenta of all radiated photons within a cone of size \(\Delta R=0.1\) around the lepton, excluding photons from hadron decays, to take into account final-state photon radiation. Small-\(R\) and large-\(R\) particle-level jets are reconstructed with the same jet algorithms as at detector level using all stable particles, but excluding the charged leptons and neutrinos not from hadron decays and also the photons used to dress the charged leptons. The b-hadrons with \(p_{\text {T}} >5\,{\mathrm {GeV}}\) are included in the clusters forming stable-particle jets with their energies set to a negligible positive value (ghost-association) [37]. Particle-level small-\(R\) jets containing one or more of these b-hadrons are considered to originate from a b-quark and are called b-flavoured at particle level. A particle-level large-\(R\) jet is considered to be b-flavoured if there is at least one b-flavoured small-\(R\) jet within \(\Delta R = 1.0\) of its axis. A particle-level large-\(R\) jet is considered to be top-labelled if it is b-flavoured and the jet mass satisfies is greater than \(140\,{\mathrm {GeV}}\). Charged leptons and small-\(R\) jets must have \(p_{\text {T}} >25\,{\mathrm {GeV}}\) and \(|\eta |<2.5\). The particle-level missing transverse momentum is defined as the vector sum of the transverse momenta of all neutrinos found in the simulation history of the event, excluding those originating from hadron decays.

Particle-level objects are subject to an overlap removal procedure similar to that applied for the detector-level objects as described in Ref. [45].

4 Event selection and reconstruction

This analysis focuses on events with one leptonically and one hadronically decaying top quark in the boosted topology plus an additional hard jet. In this topology, almost all decay products of the hadronically decaying top quark are expected to be collimated into a single large-\(R\) jet. The signature for the leptonically decaying top quark is an isolated charged lepton (electron or muon), missing transverse momentum and a b-tagged jet.

Data used in this analysis were collected by the ATLAS detector between 2015 and 2018 at \(\sqrt{s}=13~\text {Tev}\). Only events for which all detector subsystems were operational are considered. The data set corresponds to an integrated luminosity of \(139\,{\mathrm {fb}}^{-1}\).

Only events passing a single-electron or single-muon trigger [48, 49] are considered. The lowest-threshold triggers use isolation requirements to reduce the trigger rate. These have \(p_{\text {T}} \) thresholds of \(20\,{\mathrm {GeV}}\) for muons and \(24\,{\mathrm {GeV}}\) for electrons in 2015 data, and \(26\,{\mathrm {GeV}}\) for both lepton types in 2016, 2017 and 2018 data. They are complemented by additional triggers with a higher \(p_{\text {T}}\) threshold of \(50\,{\mathrm {GeV}}\) but no isolation requirements in order to increase the event acceptance.

The following event selection criteria were optimised to yield the highest expected sensitivity to the energy asymmetry for jet emission perpendicular to the beam axis. If not stated otherwise, the same criteria are applied for detector-level objects and particle-level objects:

  • Lepton candidate (\(\ell \)) – Exactly one tight lepton (muon or electron) with \(p_{\text {T}} > 27\,{\mathrm {GeV}}\) is required. This lepton is considered the candidate to be the one originated by the top quark via the W decay and is therefore called the lepton candidate (\(\ell \)). Events containing additional baseline leptons with \(p_{\text {T}} >25\,{\mathrm {GeV}}\) are rejected. At particle level, exactly one particle-level lepton with \(p_{\text {T}} > 27\,{\mathrm {GeV}}\) is required, and events containing an additional lepton with \(p_{\text {T}} >25\,{\mathrm {GeV}}\) are rejected.

  • \(E_{\text {T}}^{\text {miss}} \) and \(W \) boson transverse mass (\(\, m_{{\mathrm {T}}}^{W} \)) – To suppress events containing a fake or non-prompt lepton at detector level, it is required that \(E_{\text {T}}^{\text {miss}} \ge 20\,{\mathrm {GeV}}\) and \(E_{\text {T}}^{\text {miss}} + \, m_{{\mathrm {T}}}^{W} \ge 60\,{\mathrm {GeV}}\).Footnote 3 At particle level, no \(E_{\text {T}}^{\text {miss}} \) or \(\, m_{{\mathrm {T}}}^{W} \) cuts are applied.

  • Hadronic top candidate (\(j_h\)) – At least one top-tagged large-\(R\) jet fulfilling \(p_{\text {T}} > 350\,{\mathrm {GeV}}\), and which is separated from the lepton, \(\Delta \phi (j_h,\ell ) > 1.0\), is required. From the large-\(R\) jets fulfilling these conditions, the hadronic top candidate is defined as the one with the highest \(p_{\text {T}}\). At particle level, the top-tagging criterion is replaced by the top-labelling criterion defined in Sect. 3.2, while the kinematic requirements remain unchanged.

  • Leptonic top \({\varvec{b}}\)-jet candidate (\(j_l\)) – At least one small-\(R\) jet close to the lepton candidate, \(\Delta R(j_l,\ell ) < 2.0\), and isolated from the hadronic top candidate, \(\Delta R(j_l,j_h) > 1.5\), is required. From the small-\(R\) jets fulfilling these conditions, the leptonic top b-jet candidate is defined as the highest-\(p_{\text {T}}\) identified b-jet. If no identified b-jet fulfils these conditions, the highest-\(p_{\text {T}}\) small-\(R\) jet is taken.

  • Associated jet candidate (\(j_a\)) – At least one small-\(R\) jet with \(p_{\text {T}} > 100\,{\mathrm {GeV}}\) separated from the large jet, \(\Delta R(j_a,j_h) > 1.5\), and the lepton, \(\Delta R(j_a,\ell ) > 0.4\), and different from the leptonic jet candidate is required. From the small-\(R\) jets fulfilling these conditions, the associated jet candidate is defined as the one with the highest \(p_{\text {T}}\). The \(p_{\text {T}} \) cut was chosen such as to optimise the sensitivity to the energy asymmetry in the central \(\theta _j \) bin, which includes events where the jet is emitted perpendicular to the beam axis and the asymmetry is largest. The optimisation is a trade-off between increased asymmetries for higher \(p_{\text {T}} \) values [18] and increased statistical precision for smaller \(p_{\text {T}} \) values.

  • Identified \({\varvec{b}}\)-jet – At least one small-\(R\) jet is required to be identified as a b-jet. The b-jet must either be the leptonic jet candidate \(j_l\) or be close to the hadronic top candidate \(j_h\) within \(\Delta R < 1.0\). If two or more small-\(R\) jets are identified as b-jets, the leptonic jet candidate \(j_l\) must be identified as a b-jet and the hadronic top candidate must be close to an identified b-jet.

The missing transverse momentum is used as the estimate of the transverse components of the neutrino momentum. The longitudinal component of the neutrino momentum is obtained by requiring that the invariant mass of the lepton–neutrino system equals the W boson mass. If the resulting quadratic equation has more than one real solution, the solution with the smaller absolute longitudinal momentum is taken. If no real solution is found, the \(E_{\text {T}}^{\text {miss}} \) vector is varied minimally in length and direction to obtain one real solution.

The W boson candidate from the leptonic top quark is defined as the 4-momentum sum of the lepton candidate and the reconstructed neutrino. The leptonic top candidate is defined as the 4-momentum sum of the lepton candidate \(\ell \), reconstructed neutrino, and leptonic jet candidate \(j_l\). The charge of the leptonic top candidate is given by the lepton charge, and the opposite charge is assigned to the hadronic top candidate.

Signal events that pass the selection at both the particle level and the detector level are classified as fiducial signal, while events that pass the detector-level selection but not the particle-level selection are classified as non-fiducial signal.

5 Signal and background modelling

All signal and background processes are modelled using Monte Carlo (MC) simulations, with the exception of backgrounds with fake or non-prompt leptons, which are estimated from data (see Sect. 5.2.2). All simulated samples used EvtGen 1.6.0 [50] to model the decays of heavy hadrons, with the exception of the background samples generated with Sherpa [51]. Most of the MC samples were passed through a full simulation of the detector response with the Geant4 toolkit [52, 53]. The samples used to estimate modelling systematic uncertainties are either obtained by reweighting the default full simulation samples, or produced using the fast simulation software Atlfast-II [54], where sample comparisons are always performed using the same type of detector simulation. To model additional pp interactions from the same or neighbouring bunch crossings, the hard scattering events were overlaid with a set of minimum-bias interactions generated using Pythia 8 [55] and the MSTW2008lo [56] parton distribution function (PDF) set with a set of tuned parameters called the A3 [57] tune. Finally, the simulated MC events are reconstructed using the same software as the data.

5.1 Signal modelling

All \(t\bar{t}\) samples are normalised to the inclusive production cross section calculated by Top++ 2.0 [58] at next-to-next-to-leading order (NNLO) in QCD including the resummation of next-to-next-to-leading-logarithm (NNLL) soft-gluon terms [59,60,61,62,63,64]. This corresponds to \(\sigma (t\bar{t}) = {\mathrm {832\pm 51~pb}}\) at a top-quark mass of \(m_t = 172.5\,{\mathrm {GeV}}\), which is assumed for all \(t\bar{t}\) samples unless stated otherwise. The uncertainties in the cross section due to the choice of PDF and \(\alpha _{\text {s}} \) are calculated using the PDF4LHC prescription [65] with the MSTW2008nnlo 68% CL [56, 66], CT10nnlo [67, 68] and NNPDF2.3 5f FFN [69] PDF sets; these uncertainties are added in quadrature to the effect of the scale uncertainty.

The nominal \(t\bar{t}\) events were generated with the Powheg Box [70,71,72,73] v2 generator which provides matrix elements (MEs) at NLO in the strong coupling constant \(\alpha _{\text {s}}\), with the NNPDF3.0nlo [74] PDF and the \(h_{\mathrm {damp}}\) parameterFootnote 4 set to \(1.5 m_t\) [75]. The functional form of the renormalisation and factorisation scales (\(\mu _{\mathrm {r}}\) and \(\mu _{\mathrm {f}}\)) was set to the nominal scale of \(\sqrt{m_{t}^2 + p_{\text {T}}^2}\). The events were interfaced with Pythia 8.230 for the parton shower (PS) and hadronisation, using the A14 tune [76] and the NNPDF2.3lo PDF set.

Various alternative \(t\bar{t} \) samples are used to estimate the systematic uncertainties associated with the modelling of \(t\bar{t} \) production. To simulate more (less) initial-state radiation (ISR), \(\mu _r \) and \(\mu _f \) were varied independently by a factor of 0.5 (2.0) in the ME and the up (down) Var3c variations [76] of the A14 tune were used. Samples with the renormalisation scale for QCD emission in final-state radiation (FSR) altered by a factors of 0.5 and 2.0 were used as well. To test the dependence of the measurement on the PS and hadronisation model, Powheg Box was interfaced with Herwig 7.04 [77, 78] instead of Pythia 8.230, using the H7UE tune [78] and the MMHT2014lo PDF set [79]. The uncertainty in the matching between the ME and PS is evaluated with a sample similar to the nominal \(t\bar{t}\) signal sample but where the \(h_{\mathrm {damp}}\) parameter is set to \(3 m_t\), as well as a sample based on MadGraph5_aMC@NLO (referred to as MG5_aMC in the following) [80] interfaced with Pythia 8. The calculation of the hard-scattering uses MG5_aMC  2.6.0 with the NNPDF3.0nlo PDF set. Events were interfaced with Pythia 8.230, using the A14 tune and the NNPDF2.3lo PDF. The shower starting scale has the functional form \(\mu _{\mathrm q} = H_{\mathrm T}/2\) [81], where \(H_{\mathrm T}\) is defined as the scalar sum of the \(p_{\text {T}}\) of all outgoing partons. The choice for \(\mu _{\mathrm {r}} \) and \(\mu _{\mathrm {f}} \) is the same as used with Powheg Box. Two dedicated samples with the same settings as in the nominal \(t\bar{t}\) signal sample (Powheg Box + Pythia 8) have \(m_t\) set to either \(172\,{\mathrm {GeV}}\) or \(173\,{\mathrm {GeV}}\) to study the impact of mass uncertainties.

The \(t\bar{t} {j}\) events for the SM and SMEFT predictions used in Sects. 8 and 9 were generated with MG5_aMC interfaced with Pythia 8. The calculation of the hard-scattering uses MG5_aMC  2.7.2 with the NNPDF3.0nlo PDF set. Events were interfaced with Pythia 8.244 for PS and hadronisation, using the A14 tune and the NNPDF2.3lo PDF set. The renormalisation and factorisation scales were chosen as \(\mu _{\mathrm {r}} =\mu _{\mathrm {f}} = m_t\). The SM prediction is obtained from matrix elements at NLO in the strong coupling constant \(\alpha _{\text {s}}\). The SM–EFT interference and squared EFT–EFT contributions were generated separately at leading order in \(\alpha _{\text {s}}\) using the SMEFT@NLO [82] package. Scale uncertainties in the SM and SMEFT predictions are obtained from the envelope of nine different scale variations by varying \(\mu _{\mathrm {r}} \) and \(\mu _{\mathrm {f}} \) independently by factors of 0.5 and 2.0.

5.2 Background modelling

There are two sources of background events: events with electrons and muons produced in W or \(Z/\gamma ^{\star }\) boson decays (prompt leptons, Sect. 5.2.1) and events that have one lepton arising from another source (fake and non-prompt leptons, Sect. 5.2.2). In the latter case, electrons and muons originate from heavy-flavour hadron decays, photon conversions, or improper reconstruction of other physics objects such as hadronic jets.

5.2.1 Backgrounds with prompt leptons

Background contributions with prompt leptons originate from a wide range of processes, which are estimated from simulation using the samples described below. The systematic uncertainties in the modelling of these processes by the simulation are discussed in Sect. 7.

Single-top Wt associated production was modelled using the Powheg Box [71,72,73, 83] v2 generator to provide MEs at NLO in \(\alpha _{\text {s}}\), using the five-flavour scheme with the NNPDF3.0nlo PDF set. The functional form of \(\mu _{\mathrm {r}}\) and \(\mu _{\mathrm {f}}\) was set to the nominal scale of \(\sqrt{m_{t}^2 + p_{\text {T}}^2}\). The diagram removal scheme [84] was employed to treat the interference with \(t\bar{t}\) production [75]. Dedicated samples produced with a diagram subtraction (DS) scheme [84] are used to evaluate the uncertainty due to the treatment of the overlap with \(t\bar{t}\) production.

Single-top s-channel (t-channel) production was modelled using the Powheg Box [71,72,73, 85, 86] v2 generator to provide MEs at NLO in \(\alpha _{\text {s}}\) , using the five-flavour (four-flavour) scheme with the NNPDF3.0nlo PDF set. The functional form of \(\mu _{\mathrm {r}}\) and \(\mu _{\mathrm {r}}\) was set to \(\sqrt{m_{b}^2 + p_{{\text {T}},b}^2}\) [85]. For these processes, the events were interfaced with Pythia 8.230 using the A14 tune and the NNPDF2.3lo PDF set.

Single-top uncertainties due to ISR are estimated using varied weights in the ME and in the PS. To evaluate this \(\mu _{\mathrm {r}}\) and \(\mu _{\mathrm {f}}\) were varied by a factor of 2.0 up and down. The impact of increased or decreased FSR is evaluated using PS weights, which vary the renormalisation scale for QCD emission in the FSR by a factor of 0.5 and 2.0, respectively. The impact of the PS and hadronisation model is evaluated by comparing the nominal generator sample with events produced with the Powheg Box  v2 generator at NLO in QCD. These use the five-flavour (four-flavour) scheme for tW and s-channel (t-channel) processes, and the NNPDF3.0nlo PDF set. The events are interfaced with Herwig 7.04, using the H7UE tune and the MMHT2014lo PDF set [79]. To assess the uncertainty due to the choice of matching scheme, the nominal sample is compared with a sample generated with the MG5_aMC  2.6.2 generator at NLO in QCD in the five-flavour (four-flavour) scheme for tW and s-channel (t-channel) processes, using the NNPDF3.0nlo PDF set. The events were interfaced with Pythia 8.230, using the A14 tune and the NNPDF2.3lo PDF.

QCD V+jets (\(W \)+jets and \(Z \)+jets) production was simulated with the Sherpa  v2.2.1 MC generator. In this set-up, NLO MEs for up to two jets, and leading order (LO) MEs for up to four jets, were calculated with the OpenLoops [87, 88] and Comix [89] libraries. All Sherpa samples use the nominal Sherpa PS [90], based on Catani–Seymour dipoles and the cluster hadronisation model [91], which employs a dedicated set of tuned parameters developed by the Sherpa for use with the NNPDF3.0nnlo PDF set. The V+jets samples are normalised to a NNLO prediction [92].

Diboson (\(VV\)) samples were simulated with the Sherpa  2.2.1 and Sherpa  2.2.2 PS MC generators. Sherpa  2.2.2 was used for two- and three-lepton samples. Matrix element and PS matching [93] was applied to the various jet multiplicities, which were then merged into an inclusive sample using an improved CKKW matching procedure [94, 95]. The procedure was extended to NLO using the MEPS@NLO prescription [96]. These simulations are at NLO for up to one additional parton and at LO for up to three additional parton emissions using factorised on-shell decays. The virtual QCD corrections for matrix elements at NLO were provided by the OpenLoops library [87, 88]. The calculation was performed in the \(G_\mu \) scheme, ensuring an optimal description of pure electroweak interactions at the electroweak scale.

The production of \(t\bar{t} V \) and \(t\bar{t} H \) events was modelled using the MG5_aMC  2.3.3 and Powheg Box generators, respectively. The generators provide matrix elements at NLO in \(\alpha _{\text {s}}\), with the NNPDF3.0nlo PDF set. For \(t\bar{t} V \) and \(t\bar{t} H \) production, the events were interfaced with Pythia 8.210 and Pythia 8.230, respectively. Each uses the A14 tune and the NNPDF2.3lo PDF set.

5.2.2 Backgrounds with fake or non-prompt leptons

To estimate the total contribution of events with fake or non-prompt leptons a data-driven “matrix-method” [97] is used. Two categories of events are selected, satisfying ‘baseline’ (identification only) and ‘tight’ (identification and isolation) lepton selection requirements as described in Sect. 3.1. The real (fake/non-prompt) lepton efficiency, \(\varepsilon _{\text {real}}\) (\(\varepsilon _{\text {fake}}\)), is defined as the ratio of the number of real (fake/non-prompt) leptons satisfying the tight selection to the number of real (fake/non-prompt) leptons satisfying the baseline selection.

The efficiencies for real and fake/non-prompt leptons are parametrised with respect to the \(p_{\text {T}} \) and \(|\eta |\) of the leptons and are measured separately for electrons and muons in control regions that are statistically independent of the signal region. The fake/non-prompt efficiencies are estimated using single-lepton events from data, where criteria are imposed to suppress the contribution from real leptons. The requirement on the missing transverse momentum is reversed relative to the signal region, to \(E_{\text {T}}^{\text {miss}} < 20 \,{\mathrm {GeV}}\), to reduce the contribution from W boson and \(t\bar{t}\) events. The residual contribution from real prompt-lepton events in the selected sample is estimated using a MC simulation and subtracted separately from both the tight and baseline samples before the ratio of these samples is calculated to measure the efficiency. The efficiency for fake/non-prompt electrons (muons) ranges from \(23\%\) (\(13\%\)) to \(59\%\) (\(23\%\)), depending on the lepton \(p_{\text {T}} \), \(|\eta |\) and data taking period. Any differences between the fake/non-prompt lepton efficiencies in the control region with \(E_{\text {T}}^{\text {miss}} < 20\,{\mathrm {GeV}}\) and a validation region with \(E_{\text {T}}^{\text {miss}} > 20\,{\mathrm {GeV}}\) have been found to be well within the normalisation uncertainty applied on fake/non-prompt lepton backgrounds as described in Sect. 7. The prompt- or real-lepton efficiencies are evaluated from the simulated \(t\bar{t}\) sample, corrected with scale factors measured in data [32, 33], in a control region with zero b-tagged jets. The efficiency for real electrons (muons) ranges from \(83\%\) (\(77\%\)) to \(95\%\) (\(95\%\)), depending on the lepton \(p_{\text {T}} \), \(|\eta |\) and data taking period.

The sample of events with fake or non-prompt leptons is estimated from the weighted data events, where the weight depends on the real- and fake-lepton efficiencies. As the dominant source of these events is expected to be QCD multijet production, this background is hereafter referred to as ‘Multijet’.

5.3 Event yields

Table 1 summarizes the event yields for the signal and background predictions before unfolding (pre-marginalisation) in comparison with those observed in data. The expected yield is in agreement with the observed number of events within uncertainties. The \(t\bar{t}\) signal purity is \(87\%\), out of which 70% are fiducial signal. The non-fiducial signal consists mainly of events in which the hadronic top candidate does not pass the \(p_{\text {T}} \), mass and b-flavour requirements. The most important background is W boson production in association with jets (\(W \)+jets) which contributes at the level of 5% due to the similarity of its final state and kinematics to the signal. Single-top production contributes at the level of 3% and is dominated by the Wt-channel. The Multijet yield fraction in the signal region is found to be below 2%. The uncertainty of 21% in the \(t\bar{t}\) signal prediction originates mainly from signal modelling effects, where the corresponding impact on the signal yield cancels out for the energy asymmetry observables. Most backgrounds are expected to be symmetric and this is seen within the statistical uncertainty of the simulated samples. For \(W \)+jets events the expected asymmetry is negative due to the combination of a higher cross section for \(W^+\) than for \(W^-\) production and the selected phase space preferring hadronically decaying top candidates with a larger energy than the leptonically decaying ones, which mostly include the real W bosons.

Table 1 Event yields for the simulated signal and background predictions with uncertainties as described in Sect. 7, compared with the event yield in the signal region as observed in data. The uncertainties do not add up in quadrature to the total uncertainty, due systematic uncertainties being correlated between event samples

Figure 2 shows the level of agreement between the data and the detector-level prediction for the top-quark energy and the antitop-quark energy and \(\theta _j\). All systematic uncertainties listed in Sect. 7 are included in these comparisons. The data are consistent with the predictions within uncertainties, with a predicted rate about 10% larger than the observed data due to an overestimation of the \(t\bar{t} \) yield in the boosted phase space, as reported in Refs. [45, 98]. As the charge asymmetry observable is independent of the \(t\bar{t} \) cross-section, it is not sensitive to any mismodelling in the rate of \(t\bar{t} \) production.

Fig. 2
figure 2

Comparison of observed data with the predicted distribution of a the top-quark energy, b the antitop-quark energy, and c the angle \(\theta _j\) before unfolding (pre-marginalisation). The expectations for the \(t{\bar{t}}\) fiducial and non-fiducial signal are represented by the white and grey areas, respectively, while all non-\(t{\bar{t}}\) backgrounds are added up in the blue area. The shaded red bands correspond to the total uncertainty of the prediction, while the statistical uncertainty on the data is indicated by the black error bars, that are mostly smaller than the size of the marker

6 Asymmetry extraction and unfolding

Based on the selection criteria defined in Sect. 4, the optimised event rates \(N^{\text {opt}}(\theta _j, \Delta E)\) are extracted from the data in bins of the jet scattering angle, \(\theta _j\), and the top–antitop energy difference, \(\Delta E= E_{t} - E_{{\bar{t}}}\). Here \(N^{\text {opt}}(\theta _j)\) is defined as

$$\begin{aligned} N^{\text {opt}}(\theta _j)&= N(\theta _j | y_{t{\bar{t}}j} > 0) + N(\pi - \theta _j | y_{t{\bar{t}}j} < 0)\,, \end{aligned}$$

in analogy with the optimised cross section \(\sigma ^{\text {opt}}(\theta _j)\) from Eq. (2). The selected data events are divided into three bins in \(\theta _j\), \([0,\pi /4,3/5\pi ,\pi ]\), and four bins in \(\Delta E\), \([-\infty ,-100,0,100,\infty ]\,{\mathrm {GeV}}\). The choice of binning in \(\theta _j\) optimizes the significance of the energy asymmetry in the central bin at particle level. The binning in \(\Delta E\) was chosen to account for different migration behaviour for small and large energy differences, while at the same time maintaining a high event rate in each bin. Each of the three \(\theta _j \) bins is subdivided into four \(\Delta {E} \) bins for a total of 12 bins as illustrated in Fig. 3. The chosen configuration shows the best response in linearity tests with different asymmetries in pseudo-data. To obtain \(N^\text {opt}(\theta _j | \Delta {E} > 0)\) and \(N^\text {opt}(\theta _j | \Delta {E} < 0)\) used in Eq. 4 the content of the corresponding pairs of bins are later summed.

Fig. 3
figure 3

Concatenation of optimised event rates \(N^{\text {opt}}(\theta _j,\Delta {E})\) in three bins of \(\theta _j\), \([0,\pi /4,3\pi /5,\pi ]\), and four bins in \(\Delta {E} \), \([-\infty ,-100,0,100,\infty ]\,{\mathrm {GeV}}\). For each bin in \(\theta _j\), the bins coloured blue have \(\Delta {E} < 0\), while the bins coloured red have \(\Delta E > 0\). The binning is used both at the reconstruction level R and truth level T

The sample in the 12 bins measured at the detector level is corrected for detector effects (unfolded) to particle-level fiducial phase space defined in Sect. 4. The unfolding is performed with the Fully Bayesian Unfolding (FBU) [14, 99] method, which applies Bayesian inference to obtain a full posterior probability density distribution of the unfolded asymmetry.

Given an observed spectrum \(D\in {\mathbb {N}}^{N_\text {r}}\) with \(N_\text {r}\) reconstructed bins, a ‘truth’ spectrum \(T\in {\mathbb {R}}^{N_\text {t}}\) with \(N_\text {t}\) bins and a response matrix \({\mathcal {M}} \in {\mathbb {R}}^{N_\text {r}}\times {\mathbb {R}}^{N_\text {t}}\), the posterior probability density of the true spectrum T is calculated using Bayes’ theorem:

$$\begin{aligned} P(T|D,{\mathcal {M}})\propto {\mathcal {L}}(D|T,{\mathcal {M}})\pi (T), \end{aligned}$$

where \({\mathcal {L}}(D|T,{\mathcal {M}})\) is the likelihood function of D for a given T and \({\mathcal {M}}\), and \(\pi (T)\) is the prior probability density for T. While the response matrix is estimated from the simulated sample of \(t\bar{t} \) events, a uniform prior probability density in all bins is chosen as \(\pi (T)\), such that equal probabilities are assigned to all T spectra within a wide range between zero and twice the SM prediction.

The likelihood of observing data D for various predictions of signal \(R\in {\mathbb {R}}^{N_\text {r}}\) and background \(B\in {\mathbb {R}}^{N_\text {r}}\) is given by the product of Poisson probabilities for each bin of the spectrum:

$$\begin{aligned} {\mathcal {L}}(D|T,{\mathcal {M}},B)=\prod _{i=1}^{N_\text {r}}\frac{(r_i+b_i)^{d_i}}{d_i!}\text {e}^{-(r_i+b_i)}, \end{aligned}$$

where \(r_i\) and \(b_i\) are the expected signal and background yields and \(d_i\) is the observed data yield in bin i. The true distribution T is related to the reconstructed signal distribution R via the response matrix \({\mathcal {M}}\):

$$\begin{aligned} r_i(T,{\mathcal {M}})=\sum _{j=1}^{N_\text {t}}m_{ij}t_j, \qquad m_{ij}= \frac{\epsilon _{t_j} P(r_i|t_j)}{f_{{\mathrm {acc}},r_i}} \end{aligned}$$
(3)

with efficiency \(\epsilon _{t_j}\), migration \(P(r_i|t_j)\) and acceptance \(f_{\text {acc},r_i}\). The efficiency \(\epsilon _{t_j}\) is the probability for an event produced in ‘truth’ bin \(t_j\) to be reconstructed in any bin, i.e. the combined detector acceptance, reconstruction efficiency and selection efficiency. The migration \(P(r_i|t_j)\) is the probability for an event produced in the ‘truth’ bin j to be reconstructed in bin i, given that it is reconstructed in any bin. The acceptance \(f_{\text {acc},r_i}\) is the fraction of events passing the detector-level selection in bin i that also satisfy the fiducial phase-space selection at particle level.

The migration matrix shown in Fig. 4 is mostly diagonal, with the smallest diagonal entry being 69.7%, indicating that the reconstruction is performing well. The average efficiencies and acceptances are \(\epsilon = 34.4\%\) and \(f_\text {acc} = 69.6\%\), respectively.

Fig. 4
figure 4

Migration matrix, normalised to 100% in each truth bin. This gives the probability of a reconstructed event created in truth bin j to be reconstructed in bin i

Systematic uncertainties are included in the FBU via marginalisation, i.e. by introducing \(N_{\text {np}}\) nuisance parameters which are integrated out to obtain the marginal likelihood:

$$\begin{aligned} {\mathcal {L}}(D|T)=\int {\mathcal {L}}(D|R(T;\pmb {\theta }_s),B(\pmb {\theta }_s,\pmb {\theta }_b))G(\pmb {\theta }_s)G(\pmb {\theta }_b)\text {d}\pmb {\theta }_s\text {d}\pmb {\theta }_b\text {,} \end{aligned}$$

where \(\pmb {\theta }_s\) are nuisance parameters for uncertainties in the modelling, object identification, reconstruction and calibration affecting both the signal prediction \(R(T;\pmb {\theta }_s)\) and background prediction \(B(\pmb {\theta }_s,\pmb {\theta }_b)\) and \(\pmb {\theta }_b\) are the nuisance parameters for background normalisation uncertainties. The priors \(G(\pmb {\theta _s})\) and \(G(\pmb {\theta _b})\) are Gaussian and truncated Gaussian distributions, respectively, with \(\mu =0\) and \(\sigma =1\). The truncation is applied to avoid a non-physical negative number of events in the background estimate.

The posterior probability distribution P(T|D) is determined by sampling the \((N_\text {t}+N_{\text {np}})\)-dimensional parameter space, evaluating the likelihood \({\mathcal {L}}(D|T\),\({\mathcal {M}})\,\pi (T)\) for each point, and projecting the samples to the T-parameter space. The unfolded asymmetry \(A_E (\theta _j)\) is computed from P(T|D) as

$$\begin{aligned} P(A_E (\theta _j)|D) = \int \delta (A_E (\theta _j)-A_E (\theta _j,T))P(T|D)\,{\mathrm {d}}T \end{aligned}$$

where \(A_E (\theta _j,T)\) is defined as in Eq. (1):

$$\begin{aligned} A_E(\theta _j,T) \equiv \frac{ N^\text {opt}(\theta _j | \Delta {E}> 0) - N^\text {opt}(\theta _j | \Delta {E}< 0)}{ N^\text {opt}(\theta _j | \Delta {E} > 0) + N^\text {opt}(\theta _j | \Delta {E} < 0) } \end{aligned}$$
(4)

where for \(\Delta E > 0\) and \(\Delta E < 0 \) the sum over the two corresponding bins is taken. The estimate and variance of the energy asymmetry \(A_E \) are given by the mean and variance of the posterior distribution. The marginalised posterior distributions of the nuisance parameters are given by projections on the corresponding one-dimensional nuisance parameter space. The sampling is based on Markov Chain Monte Carlo methods [100] with the No-U-turn sampler algorithm [101] in four Markov chains each with 10,000 sampling steps and 2500 tuning steps. The technical implementation uses the PyFBU code [102], which uses the PyMC3 package [103].

The unfolding method was found to be robust against various reweightings of the input distributions, varying the energy asymmetry in a range between \(-0.15\) and 0.15, and with respect to the impact of SMEFT operators within current global limits. As expected, any difference between the unfolded and input asymmetries in these tests is covered by the uncertainties of the response matrix. The final result was also found to be stable with respect to the chosen flat prior. A Gaussian prior with the SM prediction as the mean and a 20% uncertainty changes the central values by 2%–6% and results in approximately 4% smaller uncertainties, supporting the argument that the flat prior is a conservative choice.

7 Systematic uncertainties

Various experimental and theoretical systematic uncertainties affecting both the signal and background predictions are considered. A nuisance parameter with a Gaussian prior is assigned to each systematic uncertainty, unless otherwise specified, and their effect on the measurement is directly included in the unfolding procedure as described in Sect. 6. The smoothing and symmetrisation procedures are described in Sect. 7.3.

7.1 Experimental uncertainties

The uncertainty in the combined 2015–2018 integrated luminosity is 1.7% [104], obtained using the LUCID-2 detector [105] for the primary luminosity measurements. This uncertainty affects only the backgrounds estimated from simulation, since the luminosity uncertainty in the signal cancels out when measuring the energy asymmetry. A reweighting procedure based on the mean number of interactions is used to correct the pile-up profile in MC simulation to match the data. A systematic uncertainty is assigned to the weights. Lepton (\(e,\mu \)) identification, reconstruction, isolation and trigger performance [48, 49] can differ slightly between data and simulation such that scale factors are applied to correct for these differences. The scale factors were obtained by a tag-and-probe method in Z boson and \(J/\psi \) decays [32, 33]. Systematic uncertainties are assigned to each of these scale factors. The lepton momentum scale and resolution are corrected for the differences between data and simulation, and the systematic uncertainties for these corrections are taken into account.

For jets, the jet energy scale and resolution uncertainties are applied to both the small-\(R\) and large-\(R\) jets, taking correlations into account [43, 106, 107]. Further uncertainties are considered for the JVT [39], the jet mass calibration [43], and the top-tagging scale factors [44, 108] for tagged and mistagged jets.

Eigenvector variations for b-, c- and light-jet b-tagging efficiencies [109] are applied to the flavour-tagging scale factors, together with the uncertainty on the \(p_{\text {T}} \) extrapolation to the regions in which they could not be determined directly. The uncertainties of all the \(E_{\text {T}}^{\text {miss}} \) components associated with reconstructed objects are propagated in a fully correlated way. Additional uncertainties in the scale and resolution of the soft-track components are considered [46].

Three kinds of uncertainties are considered for the non-prompt- and fake-lepton backgrounds: the uncertainty in the prediction, due to the limited number of data events and large applied weights, which is uncorrelated for each unfolded bin; the statistical uncertainties of the real- and fake-lepton efficiencies for electrons and muons (four parameters); and a 50% normalisation uncertainty for this background [45].

7.2 Theoretical uncertainties

The relative uncertainties in the inclusive cross sections of backgrounds estimated from simulation are 5.3% for single-top [110, 111] and 5% for W+jets [112]. A conservative 50% normalisation uncertainty is applied for the remaining Z+jets, diboson, \(t{\bar{t}}V\) and \(t{\bar{t}}H\) background samples [45], which together represent only 3% of the total expected event yield. In data/MC comparison plots, a relative uncertainty of 6.1% is applied to the \(t{\bar{t}}\) cross section [58]. In the unfolding the amount of signal is unconstrained and not affected by this systematic uncertainty.

The effect of signal modelling uncertainties on the signal distribution R is estimated by varying the response matrix in Eq. (3) for the fixed nominal ‘truth’ distribution, since the unfolding procedure depends only on the response matrix and is performed without any assumption about the underlying ‘truth’ distribution.

Signal modelling uncertainties account for the following possible sources of systematic uncertainty:

  • parton shower and hadronisation model, estimated as the symmetrised difference between Pythia 8 and Herwig 7, both interfaced to Powheg Box;

  • matching between matrix element and parton shower, evaluated from the difference between the Powheg Box and the MG5_aMC predictions, both interfaced to Pythia 8, and from variations of the \(h_{\mathrm {damp}}\) parameter;

  • initial-state radiation, derived from varied weights in the matrix element and in the parton shower obtained in simulations with \(\mu _{\mathrm {r}} \) and \(\mu _{\mathrm {f}} \) values different from the nominal one;

  • final-state radiation, based on the variation of parton shower weights corresponding to different renormalisation scales;

  • top-quark mass used in the simulation.

The samples used to estimate these uncertainties are described in Sect. 5.1.

The uncertainties in parton distribution functions are evaluated using a set of 30 nuisance parameters described in the PDF4LHC15 prescription [113]. The PDF variations are obtained using alternative MC generator weights stored within the nominal Powheg + Pythia 8 \(t\bar{t}\) sample. The uncertainty from the limited size of the MC signal sample is estimated using a bootstrapping method outlined in Ref. [114] from the covariance matrix of the ensemble of repeated unfolding results with varied response matrices.

Fig. 5
figure 5

Comparison of data to the prior (pre-marginalisation) prediction for the \(\Delta {E} \) vs. \(\theta _j \) distribution used in the unfolding. The 3 differential bins in \(\theta _j \) and 4 \(\Delta {E} \) bins are concatenated to give 12 bins as shown in Fig. 3. The shaded red bands correspond to the total uncertainty of the prediction

The single-top systematic uncertainties are decorrelated between the s-, t- and Wt-channel, except for the PDF uncertainties, which are fully correlated across the different channels and with \(t\bar{t}\). Both the differences in shape and normalisation are taken into account, except for the PDF uncertainties, since their effect on the normalisation is already included in the cross-section uncertainty. The modelling uncertainties are the same ones as considered for \(t\bar{t}\) and are estimated with the same methods. In addition, the difference between the predictions for the Wt-channel obtained with diagram subtraction or removal is considered as a systematic uncertainty [84].

Fig. 6
figure 6

Ranking of systematic uncertainties in the three \(\theta _j\) bins. The black dots and error bars (upper scale) denote the mean and standard deviation of the posterior distribution of the nuisance parameter, normalised to its pre-marginalisation standard deviation. The blue and red boxes (lower scale) show the impact on the energy asymmetry from a \(1\sigma \) variation in its influencing nuisance parameter. The \(\gamma \) variations denote the statistical uncertainties of the background predictions in the corresponding bin of the \(\Delta {E} \) vs \(\theta _j \) distribution. The numbers appended to the \(W \)+jets PDF variations denote the corresponding NNPDF3.0 [74] PDF sets

For the \(W \)+jets background, modelling uncertainties are estimated by varying the renormalisation and factorisation scales up and down by a factor of 2, varying the matrix element to parton shower matching scale up and down by factors of 1.5 and 1.33, respectively, and varying the parton shower resummation scale up and down by a factor of 4, in each case relative to scale(s) for the nominal sample described in Sect. 5.2.1 (see Ref. [81] for further details). The uncertainties in the parton distribution functions are evaluated using a set of 100 PDF variations included in the NNPDF3.0 description [74], obtained using alternative MC generator weights stored within the nominal Sherpa 2.2.1 \(W \)+jets samples. To account for the systematic uncertainty in the fraction of \(W \)+jets events containing jets originating from b- or c-quarks, these events are filtered into separate samples and their individual normalisation is varied up and down by 30% [115].

7.3 Smoothing and symmetrisation

In order to mitigate effects due to statistical fluctuations in the MC samples used to estimate systematic uncertainties, a bootstrapping method [114] is used to remove statistically insignificant effects in the \(t\bar{t}\), single-top and \(W \)+jets modelling uncertainties. A variation in a detector-level bin corresponding to one standard deviation of the systematic uncertainty is considered to be statistically significant if it is more than twice as large as the statistical uncertainty of the expected yield in the bin. If at least one of the four \(\Delta E\) bins in a \(\theta _j\) interval has a significant variation, all four are left unchanged and no smoothing is applied. Same-sign \(\Delta E\) bins are merged and kept if the variation is statistically significant after adding them up, but no variation in the single bins is itself statistically significant. Otherwise, all four bins are combined and the same significance criterion is applied to the combination. If it is significant, all four bins are set to this combined value, effectively dropping the shape of the systematic uncertainty, but keeping its normalisation. If it is still not significant, the systematic uncertainty is neglected in the unfolding.

Since a nuisance parameter with a Gaussian prior is assigned to each of them, the uncertainties are symmetrised in each bin of the reconstruction-level distribution. For one-sided systematic uncertainties, the difference between the nominal yield and the varied yield is used as the standard deviation of the Gaussian prior. For two-sided systematic uncertainties, the average absolute difference between the nominal yield and the upper- and lower-side varied yields is used as the standard deviation.

8 Measurement results

The distribution of the data events at detector level binned in \(\Delta {E}\) and \(\theta _j\) is compared with the SM expectation in Fig. 5. The data are consistent with the predictions within uncertainties. This distribution is the input to the unfolding procedure. With the same number of detector-level bins as ‘truth’ level bins, the posterior prediction agrees with the data distribution and is thus not shown.

Table 2 Impact of systematic uncertainties by category. The impact of the individual uncertainties is estimated from the posterior distribution as the effect on a bin i of a \(1 \sigma \) variation in its influencing nuisance parameter j, \(\sigma _i^{(j)} = c_{ij}/\sqrt{c_{jj}}\). The impact of a group of uncertainties is calculated with Gaussian error propagation, taking into account correlations between uncertainties. The uncertainties in the luminosity and the cross sections of \(Z \)+jets, \(VV\), \(t\bar{t} V \) and \(t\bar{t} H \) backgrounds are found to be negligible and are not shown in the table
Table 3 The measurement (‘Data’) of the energy asymmetry in three bins of the jet angle \(\theta _j\) is compared with the ‘SM prediction’ obtained from simulations of \(t\bar{t} {j} \) events with MadGraph5_aMC@NLOPythia 8 at NLO in QCD for \(t\bar{t} {j} \) + PS, including MC statistical and scale uncertainties as described in Sect. 5.1. The table also shows the expected measurement values and uncertainties in the case of observing exactly the ‘SM prediction’ (labelled as ‘SM expectation’)

To determine the impact of each systematic uncertainty, a method of ranking is employed in Fig. 6. For each bin, the effects of \(1 \sigma \) variations in its influencing nuisance parameters are calculated. These effects are then used to rank each systematic uncertainty for the three \(\theta _j\) bins individually. The effects are extracted from the samples of the posterior distribution with \(\sigma _i^{(j)} = c_{ij}/\sqrt{c_{jj}}\) being the estimated shift of bin i due to a shift of nuisance parameter j, where \(c_{ij}\) denotes the covariance of the posterior distributions. The impact of systematic uncertainties is largest in the central bin due to the \(t{\bar{t}}\) FSR variation and the choice of matching scheme in the single-top Wt channel being significant compared to the statistical precision of the MC samples, whereas the impact is negligible in the outer bins and therefore ignored in accord with the smoothing procedure described in Sect. 7.3. The different size of the systematic effects is explained by the difference in the relative populations of the \(\Delta E\) bins in each angular interval, causing bin-to-bin migrations produced by the systematic variations to affect the asymmetry most strongly in the central \(\theta _j\) bin. The statistical uncertainties displayed in Fig. 6 are given by the uncertainty from unfolding with all nuisance parameters fixed to their post-marginalisation values. The MC statistical uncertainty of the response matrix is derived with the bootstrapping method described in Sect. 7. The impact of systematic uncertainties by category is calculated with Gaussian error propagation and shown in Table 2. The measurement is dominated by the data statistical uncertainty; the largest systematic uncertainties are from the jet energy resolution, the \(t\bar{t} \) modelling and the statistical uncertainty of the multijet estimate in the first, second and third bins, in that order. The first \(\theta _j \) bin preferentially selects jets closer to the beamline as compared to the other bins; the jet energy resolution uncertainty is therefore larger, as this uncertainty increases with the absolute value of \(\eta \) within the analysis fiducial volume [38].

The measured \(A_E\) is obtained from the unfolded \(\Delta {E}\) distributions in the three \(\theta _j\) bins and is shown in Table 3 and in Fig. 7 together with the SM prediction including MC statistical and scale uncertainties obtained as described in Sect. 5.1. In the highest \(\theta _j\) bin the uncertainty in the SM prediction is similar to the expected sensitivity of the measurement and dominated by scale uncertainties. The measured energy asymmetry is in good agreement with the SM prediction with a combined p-value of 0.80. In the central bin, the measured (expected) energy asymmetry differs from zero by 2.1 (1.9) standard deviations. While the systematic uncertainties of the total cross section dominate for the \(\Delta {E}\) and \(\theta _j\) distribution in Fig. 5, these uncertainties cancel out in the \(A_E\) measurement. The correlations between \(A_E\) values measured in pairs \(\theta _j\) bins are always smaller than 5% in magnitude.

Fig. 7
figure 7

Data measurements (black points with errors) and predictions (blue lines) of the energy asymmetry in three bins of the jet angle \(\theta _j\). The SM prediction was obtained from simulations of \(t\bar{t} {j} \) events with MadGraph5_aMC@NLOPythia 8 at NLO in QCD for \(t\bar{t} {j} \) + PS, including MC statistical and scale uncertainties (blue shaded bands) as described in Sect. 5.1

9 SMEFT interpretation

The sensitivity of the energy asymmetry to new physics above a scale \(\Lambda \) is investigated in the framework of the Standard Model effective field theory (SMEFT). To this end, the SM Lagrangian \({\mathcal {L}}_\text {SM}\) is extended by a series of local operators \(O_k\),

$$\begin{aligned} {\mathcal {L}}_\text {SMEFT} = {\mathcal {L}}_\text {SM} + \sum _k \frac{C_k}{\Lambda ^2}\,O_k + \dots \,. \end{aligned}$$
(5)

The sum runs over all SM gauge-invariant operators of mass dimension six. The ellipsis indicates higher-dimensional operators in the SMEFT expansion, which are neglected in this analysis. The analysis also assumes CP invariance, which implies that all Wilson coefficients \(C_k\) are real. Setting all the Wilson coefficients to zero corresponds to the SM. The cut-off scale of this effective theory is set to \(\Lambda = 1\,{\mathrm {TeV}}\) throughout the analysis.

Fig. 8
figure 8

Energy asymmetry \(A_E\) in three \(\theta _j\) bins, \(A_E^1\) (black, dashed), \(A_E^2\) (blue, solid) and \(A_E^3\) (green, dotted), as a function of individual Wilson coefficients. In each sub-figure, only one coefficient is considered, while all other coefficients are set to zero

In the Warsaw basis [116], 15 dimension-six operators affect \(t{\bar{t}}j\) production at tree level [19]. The energy asymmetry is particularly sensitive to the chirality and the colour charges of the involved quark fields. To demonstrate this sensitivity, the focus of this numerical analysis is on six selected four-quark operators [117]

$$\begin{aligned} O_{Qq}^{1,8}&= ({\bar{Q}}_\text {L}\gamma _\mu T^A Q_\text {L})({\bar{q}}_\text {L}\gamma ^\mu T^A q_\text {L}) \nonumber \\ O_{Qq}^{1,1}&= ({\bar{Q}}_\text {L}\gamma _\mu Q_\text {L})({\bar{q}}_\text {L}\gamma ^\mu q_\text {L}) \nonumber \\ O_{tu}^8&= ({\bar{t}}_\text {R}\gamma _\mu T^A t_\text {R})({\bar{u}}_\text {R}\gamma ^\mu T^A u_\text {R}) \nonumber \\ O_{tu}^1&= ({\bar{t}}_\text {R}\gamma _\mu t_\text {R})({\bar{u}}_\text {R}\gamma ^\mu u_\text {R}) \nonumber \\ O_{tq}^8&= ({\bar{q}}_\text {L}\gamma _\mu T^A q_\text {L})({\bar{t}}_\text {R}\gamma ^\mu T^A t_\text {R}) \nonumber \\ O_{tq}^1&= ({\bar{q}}_\text {L}\gamma _\mu q_\text {L})({\bar{t}}_\text {R}\gamma ^\mu t_\text {R})\,. \end{aligned}$$
(6)

Left-handed quark doublets of the first two generations and the third generation are denoted by

$$\begin{aligned} q_\text {L}&= (u_\text {L},d_\text {L})^\top \,,\ (c_\text {L},s_\text {L})^\top \,,\qquad Q_\text {L} = (t_\text {L},b_\text {L})^\top \,, \end{aligned}$$

reflecting an assumed flavour symmetry among couplings of the first and second quark generations. These six operators feature different chiral structures (LL, RR, LR) and colour structures (singlet (1) and octet (8)), thus covering the relevant properties that can be tested with the energy asymmetry. Operators with right-handed down-type quarks \(d_\text {R}\) would affect the energy asymmetry very similarly to those with up-type quarks and are therefore not included in this analysis. Up- and down-quark operators can be resolved for instance with \(t{\bar{t}}Z\) or \(t{{\bar{t}}} W\) observables, which are sensitive to the weak isospin of the quarks [22].

The Wilson coefficients in the effective Lagrangian in Eq. (5) parameterize possible effects of virtual new heavy particles in the cross section \(\sigma \) and in asymmetry observables A according to

$$\begin{aligned} \sigma&= \sigma _\text {S}^{\text {SM}} + \sum _k\,C_k\sigma _\text {S}^k + \sum _{k\le l} C_k C_l\,\sigma _\text {S}^{kl}\,, \nonumber \\ A&= \frac{\sigma _\text {A}}{\sigma _\text {S}} = \frac{ \sigma _\text {A}^{\text {SM}} + \sum _k C_k \sigma _\text {A}^{k} + \sum _{k\le l} C_k C_l \, \sigma _\text {A}^{kl}}{ \sigma _\text {S}^{\text {SM}} + \sum _k C_k \sigma _\text {S}^{k} + \sum _{k\le l} C_k C_l\, \sigma _\text {S}^{kl}}\,. \end{aligned}$$
(7)

The cross sections \(\sigma _\text {S,A}^{\text {SM}}\) denote the SM contributions, while \(\sigma _\text {S,A}^k\) and \(\sigma _\text {S,A}^{kl}\) correspond to the interference of a SMEFT amplitude with the SM amplitude at \({\mathcal {O}}(\Lambda ^{-2})\) and the pure SMEFT amplitudes at \({\mathcal {O}}(\Lambda ^{-4})\), respectively. For the energy asymmetry the charge-symmetric and charge-asymmetric cross sections \(\sigma _\text {S}\) and \(\sigma _\text {A}\) are defined as

$$\begin{aligned} \sigma _\text {S}&= \sigma ^\text {opt}(\theta _j | \Delta {E}> 0) + \sigma ^\text {opt}(\theta _j | \Delta {E}< 0) \\ \sigma _\text {A}&= \sigma ^\text {opt}(\theta _j | \Delta {E} > 0) - \sigma ^\text {opt}(\theta _j | \Delta {E} < 0)\,, \end{aligned}$$

with \(\sigma ^\text {opt}\) defined as in Eq. (2). The SM prediction and SMEFT contributions are computed in \(t\bar{t} {j} \) production at NLO and LO in QCD, respectively, as described in Sect. 5.1. Relative to the SM predictions \(\sigma _\text {S,A}^{\text {SM}}\), the four-quark operator contributions \(\sigma _\text {S,A}^k\) and \(\sigma _\text {S,A}^{kl}\) grow at high energies as \((m_{t{{\bar{t}}}}/2m_t)^2\) and \((m_{t{{\bar{t}}}}/2m_t)^4\), respectively [22]. In this analysis, the energy asymmetry is defined in the boosted regime, which probes invariant masses in the range \(600 \lesssim m_{t{{\bar{t}}}} \lesssim 1200~\text {Gev}\), well above the \(t{{\bar{t}}}\) threshold. The energy enhancement partially cancels out in the normalised asymmetry, but is still present due to the growth of the charge-asymmetric cross section \(\sigma _\text {A}\) in \(\Delta E\) at high energies.

Figure 8 shows the energy asymmetry as a function of individual Wilson coefficients; \(A_E^1\), \(A_E^2\) and \(A_E^3\) correspond to the three bins in \(\theta _j\), \([0,\pi /4,3/5\pi ,\pi ]\), respectively. To assess the sensitivity of the energy asymmetry to the various operator contributions, \(\chi ^2\) fits of the SMEFT predictions \(x^{\text {SMEFT}}\) to the measured energy asymmetry \(x^{\text {d}}\) in the three jet-angle bins \(x = (A_E^1,A_E^2,A_E^3)^\top \) are performed, fitting one or two coefficients at a time, using

$$\begin{aligned} \chi ^2 = (x^{\text {d}} - x^{\text {SMEFT}})^\top C^{-1} (x^{\text {d}} - x^{\text {SMEFT}})\,. \end{aligned}$$

Correlations among the measurements and among the predictions are taken into account by the covariance matrix

$$\begin{aligned} C = C^{\text {d}} + C^{t}, \qquad C^{t} = C^{\text {MC stat}} + C^{\text {scale}}, \end{aligned}$$

which contains the measurement uncertainties \(C^\text {d}\) and the prediction’s theoretical uncertainties \(C^\text {t}\) due to both the MC sample size, \(C^{\text {MC stat}}\), and the renormalisation and factorisation scale uncertainties, \(C^{\text {scale}}\). The predictions \(x^{\text {SMEFT}}\) are functions of the Wilson coefficients \(C_k\), shown in Eq. (7), that enter the fit as parameters. The corresponding uncertainties associated with \(\sigma _\text {S,A}^k\) and \(\sigma _\text {S,A}^{kl}\) are contained in \(C^{\text {MC stat}}\) and \(C^{\text {scale}}\). The covariance matrix for the scale uncertainties in the SM and SMEFT contributions to \(x^{\text {SMEFT}}\) in each \(\theta _j\) bin is obtained from the envelope of nine different settings for the renormalisation and factorisation scales, varied independently by factors of 0.5 and 2.0 and assuming no correlation between the bins. The choice of no bin-to-bin correlation has negligible impact on the result, since theoretical uncertainties have only a small impact on this observable, except in the last \(\theta _j\) bin.

Figure 9 shows the bounds on individual Wilson coefficients obtained from combined fits to \(A_E\) in all three \(\theta _j\) bins with only one operator contributing at a time. The corresponding 68% and 95% confidence level (CL) intervals are listed in Table 4. Overall, the energy asymmetry is sensitive to four-quark operator coefficients in the range of \([-2,2]\,(\text {TeV}/\Lambda )^2\) at 95% CL, with the precise bounds varying for individual operators.

Fig. 9
figure 9

Bounds on individual Wilson coefficients from the energy asymmetry, obtained from a combined fit to the measured (black) and expected (blue) energy asymmetry in all three \(\theta _j\) bins. The bounds on \(C\,(\,{\mathrm {TeV}}/\Lambda )^2\) are reported at the 68% CL (solid) and 95% CL (dashed)

Table 4 Bounds on individual Wilson coefficients \(C\,(\,{\mathrm {TeV}}/\Lambda )^2\) from the energy asymmetry, obtained from a combined fit to its values in all three \(\theta _j\) bins: \(A_E^1\), \(A_E^2\), and \(A_E^3\). The SMEFT predictions include operator contributions to \(\sigma _\text {S}\) and \(\sigma _\text {A}\) up to \({\mathcal {O}}(\Lambda ^{-4})\) (left) and up to \({\mathcal {O}}(\Lambda ^{-2})\) (right)

To estimate the relative impact of \({\mathcal {O}}(\Lambda ^{-2})\) versus \({\mathcal {O}}(\Lambda ^{-4})\) operator contributions, Table 4 shows the bounds obtained from the energy asymmetry when including operator contributions to the charge-symmetric and charge-asymmetric cross sections \(\sigma _\text {S}\) and \(\sigma _\text {A}\) up to \({\mathcal {O}}(\Lambda ^{-4})\) (left two columns) and up to \({\mathcal {O}}(\Lambda ^{-2})\) (right two columns). Comparing these two scenarios, it is clear that the bounds are dominated by the \({\mathcal {O}}(\Lambda ^{-4})\) contributions \(\sigma _\text {S,A}^{kl}\). The results at \({\mathcal {O}}(\Lambda ^{-2})\) are listed here only to assess the relevance of the various operator contributions.

Due to its sensitivity to four-quark operators the energy asymmetry will provide additional constraining power when included in global SMEFT fits. Recent global fits of top-quark observables lead to stronger bounds on four-quark coefficients, see for instance Figure 5.7 in Ref. [21], Figure 13 in Ref. [22] or Figure 5 in Ref. [23]. This is due to charge-symmetric observables such as the invariant mass distribution \(\text {d}\sigma /\text {d} m_{t{{\bar{t}}}}\), where the sensitivity to four-quark operators is strongly enhanced at high energy.

The main asset of the energy asymmetry is its ability to resolve new directions in the SMEFT parameter space. In particular, charge asymmetries are very sensitive to the quark chiralities and the colour structure of effective interactions. The analytic dependence of the energy asymmetry has been studied in detail in Ref. [19]. To explore the numerical sensitivity to the chiral and colour structure of the six operators from Eq. (6), fits to pairs of Wilson coefficients were performed. Figure 10 shows the bounds on these coefficients from a combined fit of the energy asymmetry in all three jet-angle bins. Most bounds are centred near (0, 0), due to good agreement between the measurement and the SM prediction.

The various pairs of operators have been chosen in order to investigate their chiral structure and colour structure separately. Figure 10a, b show colour-singlet operators with different quark chiralities. Figure 10c, d show the same chirality scenarios as in Fig. 10a, b, but for colour-octet operators. Figure 10e, f show operators that differ only in their colour structure. The main features observed are discussed below; details about the operator dependence of the asymmetry can be found in Ref. [19].

Fig. 10
figure 10

Bounds on Wilson coefficients from two-parameter fits of the energy asymmetry \(A_E\) in all three \(\theta _j\) bins. The inner green and outer yellow areas show the 68% and 95% CL limits obtained from the measured energy asymmetry, respectively. The black cross denotes the minimum of the \(\chi ^2\) fit. Solid and dashed blue contours show the expected 68% and 95% CL limits from the energy asymmetry, respectively

In Fig. 10a, b, the two operators differ only by the chirality of the top quark (left) and the chirality of the up quark (right). For operators with left- versus right-handed tops (left), a blind direction is observed along \(C_{Qq}^{11} = \pm C_{tq}^1\). This blind direction is slightly broken by the linear SMEFT contribution of \({\mathcal {O}}(\Lambda ^{-2})\) (see Equation (3.17) in Ref. [19]). For operators with different initial quark chiralities (right), the blind direction is broken at \({\mathcal {O}}(\Lambda ^{-4})\), because the operators \(O_{Qq}^{1,1}\) and \(O_{tu}^{1}\) do not interfere with each other in the limit of massless incoming quarks. Colour-octet operators, shown in Fig. 10c, d, interfere with QCD. This effect lifts the blind direction in the \((C_{Qq}^{18},C_{tq}^8)\) plane.

The bounds for colour singlets are generally stronger than for colour octets. This can be seen by comparing the bounds in Fig. 10a, b with those in Fig. 10c, d or by comparing the bounds on each of the coefficients in Fig. 10e, f. The different sensitivity is due to colour factors in the amplitude, which lead to a larger contribution of colour-singlet coefficients to the asymmetry.

Due to the extra jet in \(t{{\bar{t}}} j\) production, the QCD structure of the energy asymmetry is not the same as for the rapidity asymmetry in \(t{{\bar{t}}}\) production. Indeed, the two asymmetries probe different directions in chiral and colour space. This complementarity can be estimated by comparing Fig. 10 with Figure 6 in Ref. [19], which shows the expected bounds from the energy asymmetry compared with the expected bounds from the rapidity asymmetry. For the colour-singlet operators in Fig. 10a, b, the two asymmetries probe similar areas in the parameter space. For the colour-octet operators in Fig. 10c, d, however, the shapes of the bounds look very different. In particular, the rapidity asymmetry leaves a blind direction in the \((C_{Qq}^{18},C_{tq}^8)\) plane, which is broken by the energy asymmetry due to operator interference with the QCD amplitude as shown in Fig. 10c. The different shapes of the bounds in Fig. 10e, f are due to the colour-singlet and colour-octet contributions differing between \(t{\bar{t}}\) and ttj production, which is probed by the asymmetries with a high sensitivity.

This comparison demonstrates once more the relevance of the energy asymmetry for global fits. Combining the energy and rapidity asymmetries in a joint fit would result in a better resolution of the Wilson coefficients in the SMEFT parameter space. Moreover, the energy asymmetry resolves remaining blind or nearly blind directions left by the rapidity asymmetry and other top-quark observables. In global analyses the energy asymmetry can thus be a useful tool to gain information about the chiral and colour structure of possible UV theories.

10 Summary

This paper reports the first measurement of the energy asymmetry in jet-associated top-quark pair production in the boosted topology in the lepton+jets decay channel, using \(139\,{\mathrm {fb}}^{-1}\) of \(\sqrt{s}=13\,{\mathrm {TeV}}\) proton-proton collision data collected by the ATLAS experiment at the LHC. The measurement was performed differentially in three bins of the jet scattering angle \(\theta _j\). A fully Bayesian unfolding method was used to correct for detector resolution and acceptance effects.

The differential measurements of the energy asymmetry in all three \(\theta _j\) bins are consistent with the SM predictions calculated at NLO in QCD, including effects of parton showering and hadronisation. In the second bin, \(\pi /4< \theta _j < 3\pi /5\), with jet emission perpendicular to the beam axis, the energy asymmetry is measured as \(A_E^2 = -0.043\pm 0.020\), in good agreement with the QCD prediction of \(-0.037\pm 0.003\). The measurement uncertainty is dominated by the limited data sample in the boosted phase-space region.

To determine the sensitivity of the energy asymmetry to new physics, this observable was analyzed within the SMEFT framework. The energy asymmetry is highly sensitive to the chiral and colour structure of four-quark operators with top quarks. Based on SMEFT predictions in terms of dimension-six operators, bounds on the Wilson coefficients of individual operators are extracted.

The energy asymmetry in \(t\bar{t} {j} \) production has a sensitivity to some individual four-quark operators that is similar to that of other observables in the top sector and probes new directions in the parameter space of Wilson coefficients. The sensitivity to these new directions is illustrated in the two-parameter fits of operator pairs with different chiral and colour structures. These results show that the energy asymmetry complements the sensitivity of other observables, such as differential cross sections and the rapidity asymmetry, and hence that the energy asymmetry is a valuable new observable in global SMEFT fits.